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Abstract. In this paper, we give a necessary condition for an infinite word defined by a non- 
degenerate interval exchange on three intervals (3iet word) to be invariant by a substitution: a 
natural parameter associated to this word must be a Sturm number. We deduce some algebraic 
consequences from this condition concerning the incidence matrix of the associated substitution. 
As a by-product of our proof, we give a combinatorial characterization of Siet words. 



1. Introduction 

The original definition of a Sturm number using continued fractions was introduced in 1993 
when Crisp et al. [12] showed that a homogeneous sturmian word (i.e., a sturmian word with 
slope e and intercept xq = 0) is invariant under a non-trivial substitution if and only if e is a 
Sturm number. In 1998, Allauzen [4 provided a simple characterization of Sturm numbers: 

A quadratic irrational number e with conjugate e' is called a Sturm number if 

(1) eG(0,l) and e' ^ (0, 1) . 



For general sturmian words (with arbitrary intercept xq), the fact that e is a Sturm number 
is only a necessary but not a sufficient condition for invariance under a substitution; this is clear 
since there can be only a countable number of such invariant words, while the sturmian words 
with a given slope are determined by their intercept, hence they are uncountable in number. 
For a complete characterization, see [20] [6l |8]. 

In this paper we study invariance under substitution of infinite words coding non-degenerate 
exchange of three intervals with permutation (321)j^ These words, which are here called non- 
degenerate Siet words, are one of the possible generalizations of sturmian words to a three- 
letter alphabet. Some combinatorial properties characterizing the language of Siet words are 
described in [13]. It is well known that substitutive Siet words, that is, Siet words that are 
image by a morphism of a fixed point of a substitution correspond to quadratic parameters, see 
e.g. [H 121 dni [TSl [18] . Let us stress the fact that we consider in the present paper fixed points 
of substitutions and not substitutive words. 

Sturmian words can be equivalently defined as aperiodic words coding a rotation, that is, an 
exchange of two intervals with lengths say a, /3. The slope of the sturmian word, which we 
have denoted by e, is then equal to e = The term 'slope' for the parameter e comes from 

the fact that the sturmian word with slope e can be constructed by projection of points of the 
lattice to the straight line y = ex; it will prove convenient to abuse the language by speaking 
of the slope of the rotation, this slope is the complement to 1 of the more usual angle of the 
rotation. Since the sturmian word does not depend on the absolute lengths of the two intervals 
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being exchanged but one their ratio, then the lengths are often normahzed to satisfy a + P = 1. 
In this case a and e coincide. 

The same situation appears for 3iet words which code exchange of three intervals with lengths, 
say, a, /?, 7. The commonly used normalization of parameters is a + /? + 7 = 1. However, much 
more suitable appears to be the normalization a + 2/3 + 7 = 1. Let us mention three arguments 
in favor as results of papers [H [Sj [131 E!- If ''^ is an infinite word coding exchange of three 
intervals of lengths a, /3, 7, then: 

• the infinite word u is aperiodic if and only if ^-J^ — ^ Q; 

a + 2/i( + 7 

• if n is assumed to be aperiodic, then u codes a non-degenerate exchange of three intervals 

if and only if " + ^ + ^ ^ Z + Z "+/ ; 

^ Q + 2/? + 7 ^ a + 2/3 + 7' 

• the infinite word u can be constructed by projection of points of the lattice on the 

straight line y = ^-J^ — x. 

a + 2/? + 7 

We will give in Section [3] and |4] a short proof of these facts by recalling that such an exchange 
of three intervals can always be obtained as an induced map of a rotation (exchange of two 
intervals) on an interval of length a+2/3+7; in the process, we will give a complete combinatorial 
characterization of 3iet words, as follows: 

Theorem A. Let u be a sequence on the alphabet {A, B, C} whose letters have positive densities. 
Let a : {A, B, C}* {0, 1}* a and a : {A, B, C}* {0, 1}* be the morphisms defined by 

(t(A) = 0, a{B)= 01, a{C) = l 

a'{A) = 0, a'{B) = 10, a'{C) = 1. 
The sequence u is an aperiodic 3iet word if and only if cr(n) and (t'{u) are sturmian words. 

This paper adds yet another argument supporting the normalization a + 2/3 + 7 = 1, by the 
following necessary condition: 

Theorem If a non- degenerate 3iet word is invariant under a primitive substitution, then 

e := "+/ ^s a Sturm number. 
a + 2/3 + 7 

Remark that, in that case, the corresponding homogeneous sturmian word is also substitution 
invariant. A forthcoming paper [7| will give a complete characterization of substitution invariant 
3iet words. Note that it is a natural question to ask wether, when n is a substitution invariant 
3iet word, one or both of the sturmian words cr{u), (t'{u) are also substitution invariant. 

This paper is organized as follows. The introductory notation and definitions are given in 
Section [2} Section [3] and |4] are devoted to the description of a classical exduction process in terms 
of substitutions and to the proof of Theorem A. Section |5] and Section [6] gather the required 
material for the proof of Theorem B, namely, properties of translation vectors and balance 
properties. Theorem B is proved in Section [7} 

2. Preliminaries 

We work with finite and infinite words over a finite alphabet A = {oi, . . . , a^}. The set of 
all finite words over A is denoted by A* . Equipped with the binary operation of concatenation 
and the empty word, it is a free monoid. The length of a word w = W1W2 • • • is denoted by 
\w\ = n, the number of letters Oj in the word w is denoted by \w\ai. 
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An infinite concatenation of letters of A forms the infinite word u = ('Un)neN) 

U = UQU1U2 • • • • 

A word w is said to be a factor of a word u = (tin)neN if there is an index i G N such that 
w = ttjttj+i • • • Ui+n-i- The set of all factors of u of length n is denoted by Cn{u). The language 
C{u) of an infinite word u is the set of all its factors, that is, 

C{u) = J Cn{u) . 

jieN 

The (factor) complexity Cu of an infinite word u is the function : N ^ N defined as 

Cu{n) := #Cn{u) . 

The density of a letter a € A, representing the frequency of occurrence of the letter a in an 
infinite word u, is defined by 

#{i I < i < n, Ui = a} 
p[a) := iim , 

if the limit exists (this is always the case for 3iet words, as is easy to prove). 

Let A and B be two alphabets. A mapping ip : A* ^ B* is said to be a morphism if 
ip{ww) = ip{w)ip{w) holds for any pair of finite words w,w £ A*. Obviously, a morphism is 
uniquely determined by the images (p{a) for all letters a € A. li A and B coincide and if the 
images of the letters are never equal to the empty word, then ip is called a substitution. 

The action of a morphism p can be naturally extended to infinite words by the prescription 

ip{u) = ip{uoUiU2 ■■■)■■= ip{uo)ip{ui)ip{u2) ■■■ . 

A infinite word u G is said to be a fixed point of the morphism ip if ip{u) = u. 

The incidence matrix of a morphism over the alphabet A is an important tool which brings 
a lot of information about the combinatorial properties of the fixed points of the morphism. It 
is defined by 

{M^)ij = \p{ai)\aj = number of letters aj in the word ip{ai) . 

A morphism (p is called primitive if there exists an integer k such that the matrix is positive. 

Assume that an infinite word u over the alphabet A = {ai, . . . , a^} is a fixed point of a 
primitive substitution (p. It is known [19] that in such a case the densities of letters in u are well 
defined. The vector 

Pu = {p{ai),...,p{ak)) . 

is a left eigenvector of the incidence matrix M^p, i.e., p^Mip = Ap^. Since the incidence matrix 
is a non-negative integral matrix, we can use the Perron-Frobenius Theorem stating that 
A is the dominant eigenvalue of M^. Moreover, all eigenvalues of M^p are algebraic integers. 

3. Exchanges of three intervals as induction of rotations 
Let a,/?,7 > and denote by 

lA-=[0,ct), lB-=[a,a + P), Ic ■= [a + P,a + P + j), and I := Ia^ Ib^ Ic , 
and let iyi = /3 + 7, = 7 — a, tc" = —a — /? G M be translations vectors; we have: 

IaUIb'JIc = {I a + tA) U {Ib + tB) U {Ic + tc) ■ 

The map T defined on / by T{x) = x + tx i^ x G Ix, X = A, B,C is the exchange of three 
intervals Ia, Ib, Ic with the permutation (321). 
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As was already known long time ago (see ^]), this map can be obtained as the induction of 
a rotation on a suitable interval. We recall the construction; let Id = [a + /? + 7, a + 2/? + 7), 
and define J = I U Id- Let R be the rotation of angle ^^2i3+'y °^ defined by R{x) = x + /3 + 7 
if X G /a U Ib, and R{x) = x — a — /? if x G /c U J^; it exchanges the two intervals I a U Ib and 
Ic U Id- 

For a subset of X, the /irst return time rE^x) of a point x G -E is defined as min{n > | 
R"x £ E}. If the return time is always finite, we define the induced map or first return map of 
Ron E by Re{x) = i?'"^^(^)(x). 

Lemma 3.1. The map T is the first return map of R on I. 

Proof. One checks that R{Ia) = b+P, a+7+/3) = T{Ia), R{Ic) = [0, a) = T{Ic), R{Ib) = Id, 
and R^{Ib) = R{Id) = T{Ib)- □ 

Hence T can be obtained as induction of a rotation on a left interval (for more details, see 
e.g. [H [13] or the survey [9]). It can also be obtained as induction on a right interval, and this 
remark will prove important below: define Ie = [—(5,0), and J' = Ie^ I', consider the rotation 
R' on J' by the same angle ^^2/3+7 ' ™ same way, one proves that T is obtained as the first 
return map of R' on I. 

The underlying rotation R turns out to play an important role in the study of T; this explains 
the appearance of the number e = in the introduction: it is the slope of the rotation R. 

Notation 3.1. From now on, we will take the normalization a + 2/3 + 7 = 1. 

This amounts to normalize the interval of definition of ii to 1, and will greatly simplify the 
notation below. 

4. Characterization of non-degenerate 3iet words 

4.1. From 3iet words to sturmian words. With an initial point xq G / we associate an 
infinite word which codes the orbit of xq under T with respect to the natural partition in three 



intervals (see Definition 4.1 below). It turns out to be useful to shift the interval of definition, so 
that the free choice of the initial point xq for the orbit is replaced by the choice of a parameter 
that we call c as the position of the interval. The initial point for the orbit thus can always be 
chosen as the origin. For this we introduce the new parameters 

e := a + 13 , I := a + (3 + ^ , c:=—xq. 

The number e is the slope of the underlying rotation R, and / determines the length of the 
induction interval J. It is obvious that the above parameters satisfy 

(2) e G (0, 1) , max(e, 1 - e) < / < 1 , -/ < c < 0. 

We redefine in this setting five intervals 

Ia = [c,c + a), Ib = [c + a,c + e), Ic = [c + e,c + I), Id = [c + l,c + 1), Ie = [c - P,c). 

We define I = Ia^ Ib ^ Ici the map T (introduced above in Section |3]) is defined on / as the 
exchange of three intervals Ia,IbtIc according to the permutation (321). 
We also define 

Jo = Ia^ Ib, Ji = /c U Id and J = Jq U Ji 
Jq = U Ia, J'l = Ib ^ Ic and J' = JqU J[. 
The rotation R (resp. R') is then defined on J (resp. J') by the exchange of Jq and Ji (resp. 
Jq and J[); it has angle 1 — e, Jq and Jq have length e, whereas Ji and J( have length 1 — e. 
Let us formulate the definition of 3iet words with the use of these new parameters. 
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Definition 4.1. Let e, Z,c G M satisfy ([2|. The infinite word {un)n<=N defined by 

( A if r"(0) G Ia , 
(3) Un={ B if r"(0) G Ib , 

[ c if r"(o) G ic 

is called the 3iet word with parameters e, I, c. 

There is classical and simple way to give a combinatorial interpretation of the induction 



process of Lemma 3.1 in terms of substitutions. Consider indeed the orbit (T"(0))neN of 



under T; it is clear by Lemma 3.1 that it is a subset of the orbit (i?"'(0))„GN under i?; the points 
of the second orbit which are not in the first are exactly the points in and their preimages 
are exactly the points in Ib] the return time of these points to I is 2. Let u be the coding of 
the orbit of under T with respect to the partition in three intervals Ia,Ib,Ic'i to obtain the 
coding of the orbit of the same point under R, with respect to the partition Ia, Ib, Ic^ Id, this 
argument shows that it is enough to introduce a letter D after each B, that is to replace B by 
BD; to obtain the natural sturmian coding with respect to the partition Jq, Ji, we then project 
letters A,B to and C,D to 1. 

Definition 4.2. We denote by a (resp. a') the morphism from {A,B,C}* to {0,1}* defined by 
(j{A) = 0, a{B) = 01, a{C) = 1 (resp. a' {A) = 0, (j'{B) = 10, a'{C) = I). 

We thus have proved the following: 

Lemma 4.3. Let u be the coding of the orbit of under T, with respect to the partition Ia,Ib, Icj 
and let v (resp. v' ) be the coding of the orbit of under R (resp. R') with respect to the partition 
Joy Ji (resp. Jq, J[). Then v = cr(n), v' = a{u'). 

This implies that, if e is irrational, then a{u) and cr{u') are sturmian sequences whose density 
of equals e. 

4.2. Characterization theorem. We will now prove the reciprocal (Theorem A below); we 
need some properties of sturmian sequences. 

Let u be a sturmian sequence that codes the orbit of a rotation of angle 1 — e modulo 1 with 
density of equal to e, and let Vn be the prefix of v of length n, i.e., Vn = vq - ■ ■ Vn-i- Define a 
map 

/ : {0, 1}* ^ M by fiV) = \V\o{l - e) - \V\ie. 
From the definition, we see that 

Vn, /(K) = iKlo - ne, 

hence the sequence (/(V'n))neN is the orbit of under a rotation defined on an interval [c, c + 1), 
with c = iiai{f{Vn)\n G N}; in particular, we have, for all integers i,j, \f{Vi) — f{Vj)\ < 1. 
We have the following lemma: 

Lemma 4.4. Let v be a sturmian sequence, and let {nk)keN be a strictly increasing sequence of 
integers that satisfies Vnf,-i = 0, Vn,^ = 1. Define a new sequence v' by: v'^^_i = 1, v'^^ = 0, 
v'- = Vi otherwise. The sequence v' is sturmian if and only if for every i which is not in the 
sequence {nk)keN, and for all j, we have f{Vnj) > f{Vi). 

Proof. For all n, let stand for the prefix of v' of length n. We have f{Vi) = f{V-), except if 
i = Hk, in which case one checks that f{V^^) = /(V^^.) — 1. 

We first assume that v' is sturmian. Suppose that /(V^) > fiVn^), for some with i not in 
the sequence {nk)k<^N', then we must have f{V,-) > f{V^.) + 1; but this is impossible since v' is a 
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sturmian sequence with same density of O's as v. Hence for every i which is not in the sequence 
{nk)k&<i, and for ah j, we have f{Vi) < f{Vn^). 

Conversely, we assume that for every i which is not in the sequence (nfc)^^^, and for all j, we 
have f{Vnj) > f{Vi). One checks that for all integers \f{V-) — f{Vj)\ < 1. Indeed, this is 
immediate if i and j belong simultaneously to {nk)k£N, or else if none of them belongs to this 
sequence. If i is not in the sequence {nk)km, then - /(K^)! = 1/(^0 - /(KJ + 1| < 1. 

We deduce that the sequence v' is a balanced sequence. Indeed, take two factors W and W' of 
the same length n of the sequence v' that occur respectively at index i and j. One has 

ll^lo - \w'\o\ = - /(l^+J) - (/(y/) - f{Vl))\ < 2. 

We deduce that the densities of letters are well-defined in v' . By construction, they coincide 
with the densities of letters for the sequence v, hence v' is an aperiodic balanced sequence, it is 
thus a sturmian sequence, according to [T5]. □ 

We are now in position the prove the first theorem: 

Theorem A. Let u be a sequence on the alphabet {A, B, C} whose letters have positive densities. 
This sequence is an aperiodic 3iet word if and only if a{u) and cf'{u) are sturmian words. 



Proof. We have proved above (Lemma 4.3) that the condition is necessary. Let us prove it is 
sufficient. Let v = a{u),v' = u'{u) be the two sturmian words; by construction, they have the 
same slope e, and they coincide except on a sequence of pairs of indices (n^ — 1, n^), corresponding 
to the images of i?, where is replaced by 1 and vice versa. 

Define the function / as above, and define c = inf{/(Vfc) | k G N}, / = inf{/(y„j.) | A; e N} — c. 
From Lemma |4.4[ we deduce that an index i is of the form Uk if and only if /(Vi) > Z + c if 
inf{/(y„j.)} = min{/(^j.)} (resp. fiVi) > / + c otherwise). Then, one checks that the sequence 
u is generated by the exchange T of the three intervals Ia,Ib: Ic with either 

Ia = [c,c + £ + 1 - I), /b = [c + e + / - 1, c + e), Ic = [c + e,c + l) 

or 

/a = (c, c + e + / - 1], Ib = {c + £ + 1 -l,c + e], /c = (c + e, c + /], 

the choice of the intervals being determined by the values of and thus of v and v' ^ at the indices 
(if any) where the orbit of under T meets discontinuity points. The interval Ib corresponds 
to the times Uk — l for the sequence v, and Ia and Ic resp. to value and 1 for the other times. 
We deduce that u is aperiodic from the irrationality of e. □ 

Figure [T] gives a geometric interpretation of the proof; to the 3iet word u, we have associated 
a stepped line (bold line), by associating letter A to vector (1,0), B to (1,1), and C to (0,1). 
Remark that this stepped line is contained in a "corridor" of width less than 1; in dashed lines 
are shown the two sturmian lines associated to v and v' , obtained by enlarging the corridor on 
the right or the left to the width of the unit square. 

4.3. Complexity. It is known that the factor complexity of the infinite words ^ satisfies 
C(n) < 2n + 1 for all n G N. A short proof can be given by considering the partition V in three 
intervals; to count the number of factors of length n, it is enough to count the number of atoms 
of the partition M^zIt^^V. But it is easy to prove that these atoms are intervals, bounded 
by reciprocal images of the two discontinuity points. As there can be at most 2n such points 
between time and n — 1, there are at most 2n-|-l intervals. 

The infinite words (un)n6N which have full complexity are called non- degenerate (or regular) 
3iet words; 3iet words for which there exists n such that C{n) < 2n + 1 are called degenerate. 
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Figure 1. The stepped line associated to a 3-iet word and its two sturmian extensions 



The necessary and sufficient condition for a word (n„)„gN coding 3iet to be non-degenerate 
is the so-called i.d.o.c. (infinite distinct orbit condition). This notion has been introduced by 
Keane [XT' and requires, in this case, that the orbits of the two points of discontinuity of the 
transformation T are disjoint, formally {T"(c + I — 1 + e)}„gN n {T"(c -|- e)}„gN = 0. If this 
condition holds true, then the partition above is limited by exactly 2n points on the interval, 
hence has 2n + 1 atoms. The condition i.d.o.c is equivalent to 

(4) e^Q and l^Z + Ze =: Z[e] , 

see [Hill]. 

Remark 4.5. If e is irrational, it is classical that the rotation R is uniquely ergodic, which 
implies that T is also uniquely ergodic. In that case, the densities of letters in the 3iet aperiodic 
word are well defined and Qu is proportional to the vector of lengths of intervals Ia, Ib, Ic- 
If e is rational, the sequence u is periodic, hence the densities exist in a trivial way. 



5. Translation vectors 



Let tt be a 3iet word such as defined in Definition 4.1, In our considerations, the column 



vector of translations will play a crucial role. We denote it by 




A first remark is that the vector of translations is orthogonal to the vector of densities; this 
can be checked directly, and interpreted as the fact that the mean translation is 0, because the 
orbit under the action of the map T is bounded. 

We assume furthermore that u is fixed by some substitution f. We will now obtain a more 
subtle equation, using the substitution (p. Let us define a function g (in the flavour of the map 
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/ defined in Section 4.2) on the prefixes of the infinite word u, the fixed point of ip. For the 
prefix w = uqUi ■ ■ ■ Un-i, n > 0, we put 

^('Uoni • • • ?i„_i) := r"(0) = \w\AtA + \wB\tB + \wc\tc ■ 
In particular, the image of the empty word equals . For X £ {A, B, C}, put 

Ex '■= ^g{uoUi • • • Un-i) Un = x| = \w\b, {wlc) t wX is a prefix of ti| . 

Clearly, the closure of the set Ex satisfies Ex = Ix ■ 

The infinite word uoUiU2 ■ ■ ■ = (p{uo)ip{ui)ip{u2) • • • can be imagined as a concatenation of 
blocks ^{A), ip{B), (p{C). Positions, where these blocks start, and the corresponding iterations 
of r, are given by the following sets. For X G {A, B, C}, put 

E^(X) ■= \^a{v^{uoUi- ■ -Un-^i)) Un = X^ = ^g{ip{w)) luX is a prefix of . 
From the definition of the matrix iW^ it follows that 

(5) E^(^x) = \^i\'w\A,\MBAMc)M^t is a prefix of -uj . 
Obviously, 

E^{A) U E^^B) U E^^c) C {T"(0) I n G N} C / , 
and the union is disjoint. The fact that T^{0) belongs to E^p(^A) is equivalent to 

• UkUk+iUk+2 ■ ■ ■ has the prefix ip{A); 

• uqUi ■ ■ ■ Uk-i = ip{uQUi ■ ■ ■ Ui-i) for some i G N; 

• Ui = A. 

Similar statement is true for the elements of the sets E^pi^B) E^(^(jy Moreover, from the 
construction of if follows that if T'^(O) G then the smallest n > k for which 

r'^(O) G E^f^A) U E^(^B) U E^f^c) satisfies n-k = \ip{X)\. 

The infinite word u can therefore be interpreted as a word coding exchange of three sets 
E^(A), E^(^B), E^(c), with translations 

:= MX)\AtA + HX)\BtB + MX)\ctc ■ 

Obviously, one has 

{E^{A) + tip{A)) U {E^(^B) + tip{B)) U {E^(^c) + ^ifiiC)) = -E'^(A) U -E'^(B) U £;<^(c) C I. 

From the definition of it follows that the translation vector = (i^(yi)) ^v'(s)' ^"pCC))"^ 

satisfies 

(6) = M^t. 

6. Balance properties of fixed points of substitutions 

Definition 6.1. We say that an infinite word u = (tin)neN has bounded balances, if there exists 
< K < +00 such that for all n G N, and for all pairs of factors w,w €z Cn{u), it holds that 

\\w\a — \w\a\ < K , for all aGw4. 

The above definition is a generalization of the notion of balanced words, which correspond 
to a constant K equal to l.We have used the fact that aperiodic balanced words over a binary 
alphabet are precisely the sturmian words in the proof of Lemma 4.4 [15j . The balance properties 
of the considered generalization of sturmian words, the 3iet words, are more complicated. The 
following is a consequence of results in [1] . 
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Proposition 6.2. Let u be a 3iet word. Then u has bounded balances if and only if it is 
degenerated. 

In this paper we focus on substitution invariant non-degenerate 3iet words. We shall make 
use of the following result of Adamczewski [2^ , which describes the balance properties of fixed 
points of substitutions dependently on the spectrum of the incidence matrix. We mention only 
that part of his Theorem 13 which will be useful in our considerations. 

Proposition 6.3. Let the infinite word u be invariant under a primitive substitution ip with 
incidence matrix M^. Let A be the dominant eigenvalue of M^p. // |A| < 1 for all other 
eigenvalues A of M^, then u has bounded balances. 



7. Necessary conditions for substitution invariance of 3iet words 

We now have gathered all the required material for the proof of Theorem B which provides 
necessary conditions on the parameters of the studied 3iet words to be invariant under substi- 
tution. 

Theorem B. Let u = («n)neN be a non-degenerate 3iet word with parameters e, I, c satisfying Q 
and Q. Let ip be a primitive substitution such that (p{u) = u. Then the parameter e is a Sturm 
number. 

Proof. The density vector of the word u is the vector Qu = ~ "T^' \ ~^^^~t) ■ vector Qu is 
a left eigenvector corresponding to the Perron- Frobenius eigenvalue A. Since Qu is an irrational 
vector and M^p an integral matrix, has 3 different eigenvalues. Denote the other eigenvalues 
of M^p by Ai, A2 and by afi, X2 the right eigenvectors of the matrix corresponding to Ai and 
A2, respectively, i.e., 

(7) M^pXi = Xixi and M(^X2 = A2X2 • 

A left eigenvector and a right eigenvector of a matrix corresponding to different eigenvalues 
are mutually orthogonal. Therefore the vectors xi, X2 form a basis of the orthogonal plane to 
the left eigenvector corresponding to A. Since the vector t = (1 — e, 1 — 2e, —e)'^ is orthogonal 
to Qu^ we can write 

(8) t = lixi + 1^x2 for some fi,u £ C. 

Our aim is now to show that either /x = or 1/ = 0, i.e., that the vector t is a right eigenvector 
of the matrix M^. 

Recall that the translation vector t^, satisfies Since this holds for any substitution which 
has U0U1IL2 ■ ■ ■ for its fixed point, one can write 

(9) V = M^n t = Ml t. 

Since i(^n represents translations of subsets of a bounded interval I, the vector t^n must have 
bounded components. Combination of ([T]), (|8|, and (|9| leads to the fact that the sequence of 
vectors 

(10) Ml t = //A^fi + iyX^X2 
is bounded. 

We shall now distinguish two cases. Realize that the Perron eigenvalue A must be an algebraic 
integer either of degree three or of degree two. 
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The cubic case. Suppose that A is a cubic number. Then Ai, A2 are its algebraic conjugates. By 



assumption u is a non-degenerate 3iet word, and thus using Proposition 6.2 and Proposition |6.3| 
and the fact that Salem number^ of degree 3 do not exist, we derive that one of the eigenvalues 
Ai, A2 is in modulus greater than 1, say IA2I > 1. Boundedness of the sequence of vectors 
{M^ i)neN in (10) implies that z/ = and thus t is a right eigenvector of the matrix Mip, 



without loss of generality, we can put xi = t. 

But then the components of the vector xi = t = (1 — e, 1 — 2e, — e)"^ belong to the field Q(Ai), 
however, the first plus the last components of the vector xi are equal to the middle one, which 
gives a quadratic equation for Ai. This is a contradiction, hence that case is impossible. 

The quadratic case. We have shown that A is a quadratic number. In this case, the other 
eigenvalues of M^p are the conjugate Ai = A' of A and A2 = r G Z. Irrationality of the vector t 



implies that /i 7^ 0. Let us suppose that 7^ 0, as well. Boundedness of t in (|T0|) implies 



that |A'| < 1 and |r| < 1. By Proposition 6.3, we have \r\ > 1 and thus r = ±1. Without loss of 



generality, we can assume that r = 1, otherwise we consider the morphism (p'^ instead of 93. For 
the vector t^pn of translations of the sets E^n^j^^, -E'(p"(_B)) -E'</3"(c)) it holds that 

t<p" = /x(A')"xi + 1/X2 — > 1^x2 / . 

n— >oo 

We shall make use of the following property of infinite words coding 3iet. For arbitrary factor 
w £ C{u) denote by Iw the closure of the set {r"'(0) | ti; is a prefix of UnUn+iUn+2 " " " }• It is 
known that lyj is an interval. With growing length of w, the length |/^| of the interval 1^ 
approaches to 0. Since the morphism 93 is primitive, the length ip'^{X) grows to infinity with 
growing n for every letter X. Obviously E^nf^x) C Ip,n(^x) and lim„^oo = 0- 

Recall that E^n(^^-^, E^n(^Q-j are disjoint and their union is equal to (i?^n(^) +*<^"(A)) U 

{E^n(^B) + tip"{B)) U (-E'^"(c) + *v'"(C))- Since by assumption lim„^oo = 1^x2 / 0, for 
sufficiently large n, one of the following is true: 

- either there exist X,Y € {A, B,C}, X ^ Y such that 

Eip"{X) = E^"{Y) + ' 

- or for mutually distinct letters X, Y, Z of the alphabet we have 

ElP^{X) U E^n(^z) = l^tp"{Y) + ip"{Y) ■ 

This would however mean for the densities of letters that q{Z) = g{X) = q(Y), or q(Y) = 
g{X) + g{Z), respectively. This contradicts the fact that n is a non-degenerate 3iet word. Hence 
the assumption ly ^ leads to a contradiction. 

Thus by (|8|, the vector t is a right eigenvector of the matrix corresponding to the 
eigenvalue A'. 

Since A is a quadratic number, e is also a quadratic number and A G Q(e') = Q(e), where e' 
is the algebraic conjugate of e. Applying the Galois automorphism of the field Q(e) we obtain 
that the vector t' := (1 — e', 1 — 2e', —e')'^ is a right eigenvector corresponding to A, i.e., it has 
either all components positive or all negative. Therefore we have (1 — e')e' < 0, which means 
that e is a Sturm number. □ 

The proof of Theorem B provides several direct consequences. 



^An algebraic integer is called a Salem number, if all its algebraic conjugates are in modulus < 1 and at least 
one of them lies on the unit circle. It is known [Tl] that all Salem numbers are of even degree greater than or 
equal to 4. 
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Corollary 7.1. Let u = (ttn)neN be a non- degenerate 3iet word with parameters £,l,c satisfy- 
ing ([2| and Q. Let if be a primitive substitution such that <f{u) = u. Then 

• the incidence matrix of (p is non-singular; 

• its Perron- Frobenius eigenvalue is a quadratic number A G Q(£); 

• its right eigenvector corresponding to A is equal to (1 — e',1 — 2e', —e'Y', where e' is the 
algebraic conjugate of e. 

Another consequence of the proof of Theorem [7] is that the Perron- Frobenius eigenvalue of the 
incidence matrix M^p of the substitution (p under which a 3iet word is invariant is an algebraic 
unit. Before stating this result, realize that since t is an eigenvector of ]Sl^p corresponding to A', 
the definition of the set Ex and the equation ^ imply 

(11) ^^(X) = ^'Ex . 

In accordance with the definition of translations tx and t^{X) for a letter X in the alphabet 
A = {A, B, C} we can more generally introduce the translation t^ for any finite word w £ C{u), 
as 

tw ■= \w\AtA + \w\BtB + \w\ctc ■ 

With this notation, we can describe several properties of the sets -E^(x) where w is a proper 
prefix oi ip{X), X £ A. (The number of these sets is |(/3(^)| + |(/?(i?)| + |(/?(C)|.) The substitution 
invariance of u under 99, uqUiU2 ■ ■ ■ = ip{uo)ip{ui)(f(u2) • • • , implies the following facts. 

(1) E^^x)+tn^ = T\^KE^(x))- 

(2) The sets -E'^(x) + ^wi where w is a proper prefix of (p{X), X £ A, are mutually disjoint. 

(3) For any letter X £ A and for every proper prefix w of ^p{X), there exists a letter Y £ A 
such that E^(^x) + iw Ey- 

(4) U U {E^(x)+tv?j = EaUEbUEc. 

Corollary 7.2. Let u = (n„)„gN be a non- degenerate 3iet word with parameters e,l,c satis- 
fying ^ and Q. Let ip be a primitive substitution such that ip{u) = u. Then the dominant 
eigenvalue of the incidence matrix of ip is a quadratic unit and the parameters c, I belong 
to Q(e). 

Proof. We know already that the Perron- Frobenius eigenvalue A of the matrix iW^ is a quadratic 
number. For contradiction, assume that A is not a unit. Since M^pt = A't, we have A'Z[e] C Z[e]. 
If A is not a unit, then A'Z[e] is a proper subset of Z[e] and the quotient abelian group Z[e]/A'Z[e] 
has at least two classes of equivalence. For the purposes of this proof we shall denote by <J the 
left end-point of a given interval J. 

Reahze that Ex C Z[e], E^(x) = ^'Ex C A'Z[e] and Ex <t A'Z[;r] for all X £ A. Facts 
(2) — (4) above imply that the left boundary point of the interval i-e., the point o/^ = c 
must coincide with <\{E^p{x{) + and <{E^p(X2) +*«'2)) some letters X\^X2 £ A and some 



prefixes w\,W2 of ip{X\)^ (p{X2)^ respectively. The above property (1) and equation (11) imply 

Since T'^{x) 7^ x for all n 7^ and all x G /, we necessarily have X\ 7^ X2. Same reasons 
imply for the left boundary point of the interval Is , that there exist at least two distinct letters 
Yi Y2, such that <(^E^p(Y-) + ty-) coincide with <\Ib = c -\- 1 — {1 — e) for some proper prefixes 
Vi of ip(Yi). 
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Since the distance I — 1 + e between ol^ and <Ib is not an element of we must have 
Yi 7^ Xj for i,j = 1,2. This contradicts the fact that the alphabet has only 3 letters. Therefore 
A' is a unit. 

The fact that <Ia = c, <Ib = c + / — 1 + e, <Ic = c + e coincide with iterations of points A'c, 
A'(c + / - 1 + e), A'(c + e) implies that c, / G Q(e). □ 
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